Origin of the Scaling Law in Human Mobility: Hierarchical Organization of Traffic 
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Uncovering the mechanism leading to the scaling law in human trajectories is ol fundamental im- 
portance in understanding many spatiotemporal phenomena. We propose a hierarchical geographi- 
cal model to mimic the real traffic system, upon which a random walker will generate a power-law 
travel displacement distribution with exponent -2. When considering the inhomogeneities of cities' 
locations and attractions, this model reproduces a power-law displacement distribution with an ex- 
ponential cutoff, as well as a scaling behavior in the probability density of having traveled a certain 
distance at a certain time. Our results agree very well with the empirical observations reported in 
[D. Brockmann et al, Nature 439, 462 (2006)]. 
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Studies on the non-Poisson statistics of human behav- 
iors have recently attracted much attention [H, 0, Q . Be- 
sides the inter-event or waiting time distribution, the 
spatial movements of human also exhibit non-Poisson 
statistics. Brockmann et al. 0] investigated the bank 
note dispersal, as a proxy for human movements, and 
revealed indirectly a power-law distribution of human 
travel displacements. Gonzalez et al. [5] studied the 
human travel patterns by measuring the distance of mo- 
bile phone users' movements in different stations, and ob- 
served a similar scaling law. Actually, the mobility pat- 
terns of many animals also show power-law-like displace- 
ment distributions [6|, [7|, [8j . The ubiquity of such kind 
of distributions attracts scientists to dig into the under- 
lying mechanism. Some interpretations, such as optimal 
search strategy 0, [ip| . olfactory-driven foraging [ll| and 
deterministic walks [12| , have already been raised for the 
power-law displacement distribution in animals mobility 
patterns, however, they are based on the prey processes 
and thus cannot be used to explain the observed scaling 
law in human trajectories, which is still an open problem. 
In this paper, we propose a model to mimic the human 
travel pattern, where the hierarchical organization of the 
real human traffic systems is taken into account. Our 
model can reproduce the power-law displacement distri- 
butions, as well as the scaling behavior in probability 
density of having traveled a certain distance at a cer- 
tain time, agreeing very well with the empirical results 
reported in Ref. [J]. 

Let's think about the real human traffic systems. Gen- 
erally speaking, a district (e.g., a province or a state) 
usually has a core city, like its capital; around this core 
city, there are several big cities as the secondary cen- 
ters (e.g., municipalities); then, each of these centers is 
rounded by some counties; and towns and villages will 
surround each of the counties. A hierarchical traffic sys- 
tem is built accordingly. Imaging people traveling from a 
town, a, subordinating to the central city, A, to another 



town b that is subordinated to the central city B. There 
is usually no direct way connecting a and b, and the typ- 
ical route is a — > A — > B — > b. This kind of hierarchical 
organization is not just inside a country or a district, but 
across the whole world. For example, if one wants to 
travel from the University of Science and Technology of 
China to the University of Fribourg, there is no direct 
way connecting Hefei and Fribourg, instead, one has to 
follow the route Hefei— > Shanghai^ Zurich^ Fribourg al- 
though it is much longer than the geographical distance 
between Hefei and Fribourg. Such a hierarchical orga- 
nization as well as the resulting scale invariance in road 
networks have already been demonstrated recently fl3j ] . 

For simplicity, we call all the units cities. In our model, 
cities are organized in N layers. A uniform 3-layer system 
is shown in Fig. 1, in which, 81 cities locate on the centers 
of a 9 x 9 lattice. The most central city is put in the first 
layer, and the whole region is divided into 9 sub-regions, 
each contains a 3 x 3 lattice. Except the middle sub- 
region, all other eight sub-regions have their own central 
cities, namely the second layer cities, which locate at the 
centers of those sub-regions. Meanwhile, there are eight 
third layer cities around each of the second layer cities as 
well as the first layer city. An illustration is shown in Fig. 
1. Denote N the number of layers and K the number of 
first layer cities. We assign M sub-regions to each of the 
K lst-layer cities, with the lst-layer city locating in the 
center, and [M — 1) 2nd-layer cities are respectively put 
in the remain (M — 1) sub-regions. Each of the KM sub- 
regions is further divided into M sub-sub-regions, with 
the 1st- or 2nd-layer cities locating in the center and 
(M— 1) newly generated 3rd-layer cities put in the remain 
{M— 1) sub-sub-regions. Repeating this process until the 
A^th-layer cities are generated. For 2 < n < N, there are 
KM n - 2 (M-l) nth-layer cities. Note that, in this model, 
to make sure the lattice is fulfilled with cities, M must 
be equal to (2q — l) 2 where q is a certain integer larger 
than 1. 
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FIG. 1: (Color online) Illustration of the hierarchical struc- 
ture and moving rules in the present model, (a)-(c) show the 
connections among the different-layer cities (black lines) and 
the same-layer cities (blue lines). Where the green, red, and 
black circles denote the cities in 3rd, 2nd and 1st layers, re- 
spectively, (d) gives a typical trajectory of a walker moving 
from a lower-layer city to a higher-layer city in other sub- 
region. A fractal network model with similar motivation can 
be found in Ref. 11311. 



The K first layer cities are fully connected with each 
other, each of which is connected with the nearest (M— 1) 
TVth-layer cities and the nearest (M — 1) second layer 
cities. Each of the second layer cities is connected with 
the nearest (M — 1) iVth-layer cities, the nearest (M— 1) 
third layer cities, as well as the other (M — 2) second layer 
cities belonging to the same first layer city. Actually, for 
1 < n < N, each of the nth-layer cities is connected with 
the nearest [M — 1) TVth-layer cities, the nearest (M — 1) 
(n+l)th-layer cities, as well as the other (M— 2) nth-layer 
cities belong to the same (n — l)th-layer city. Note that, 
all the connections are symmetry, and the direct move 
between two cities is allowed only if they are connected. 
Figure 1 illustrates an example with N = 3, K = 1 
and M — 9. The modeled system can be viewed as a 
hierarchical network. Different from the real hierarchical 
networks [3| or the mathematical models [Tol . RH ] , it is 
hierarchical but not scale-free. 
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FIG. 2: (Color online) Burstiness of long-range travels. Pa- 
rameters are set as N = 5, M = 9, and K = 9. We 
choose N = 5 because it is typical in the real world, such 
as province-city-county-town-village in China and region- 
department-arrondissement-canton-commune in France, (a) 
The sequence of the displacement, L, of a random walker in 
10 4 consecutive steps, (b) The walker's trajectory in 2000 
consecutive steps, (c) The distribution P(L), where the dash 
line, as a guide for eyes, is of slope -2. The data in panel (c) 
is obtained by averaging over 100 independent runs, and each 
lasts 10 5 steps. 



We consider the simplest case where a random walker 
is consequently moving from one city to a random neigh- 
boring city (two cities are said to be neighboring if they 
are connected). Figure 1(d) shows a typical trajectory of 
a walker moving from a lower-layer city to a higher-layer 
city in other sub-region. The displacement of the walker 
in one step is defined as the geometric distance L, and the 
distribution of L is what we mainly concern in this pa- 
per. As shown in Fig. 2, burstiness of long-range travels 
is clearly observed and the distribution of travel displace- 
ment, P(L), approximately obeys a power-law form with 
exponent -2. 

The essential physics of this model is a random-walk 
process in a geographical network where edges are of dif- 
ferent geometric distances. For a random walker in a 
connected symmetry (undirected) network, in the long 
time limit, each edge has the same chance to be visited 
(this proposition is hold even for a very heterogenous net- 
work, since for an arbitrary node, the number of times 
being visited is proportional to its degree while the prob- 
ability that a specific adjacent edge of this node is con- 
secutively visited is inversely proportional to the degree. 
Details can be found in Ref. [17j)- Therefore, the dis- 
placement distribution of a random walker is equivalent 
to the distribution of edges' geometric distances. Let d n 
(n > 1) denote the average geometric distance of edges 
connecting two nth-layer cities (they must belong to the 
same (n — l)th-layer city) and an nth-layer city and an 
(n — l)th-layer city (the former belongs to the latter), 
and m n denote the total number of edges contributed 



to d n - Obviously, for 1 < n < N, d r , 
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FIG. 3: (Color online) Illustration of a typical five-step move- 
ments in a 3-layer inhomogeneous model, where red, blue, 
and yellow respectively denote the Ist-layer, 2nd-layer and 
3rd-layer cities. For clarity, we do not plot the connections 
between cities in the same layer. 
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FIG. 4: The trajectory of a walker in 2000 consecutive steps 
in the heterogeneous model with parameter setting N — 5, 
M = 9, K = 9, and S = 100. (a) and (b) respectively 
correspond to the cases of r — 1 and r = 2. 



M, N and djv can be considered as constants, and thus 
we have d n ~ M~ n / 2 . On the other hand, for n > 1, 
m„ = KC 2 M M n - 2 where C 2 M = M(M - l)/2. That is, 
m„ ~ M™. Roughly speaking, d n and m„ play the roles 
of geometric distance and the number of edges associated 
with such a distance. As we have already obtained the 
scaling m n ~ d~ 2 , we can deductive that the displace- 
ment distribution for a system with sufficiently large N 
and in the long time limit obeys the scaling P(L) ~ L~ 2 , 
which is in accordance with the simulation result shown 
in Fig. 2(c). The observed result implies that the scaling 
law in human trajectories may results from the inherent 
hierarchical organization in traffic systems. 

Although the present model can reproduce the power- 
law displacement distribution, the absolute value of ex- 
ponent is higher than the empirical ones [U, Q , and the 
model is obviously oversimplified. Firstly, real cities are 
not located in a completely uniform matter, but with 
some irregularity. Secondly, the model assumes that each 
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FIG. 5: (Color online) Displacement distribution, P(L), for 
inhomogeneous model, obtained by 100 independent runs, 
each of which lasts 10 5 time steps. The parameters are N = 5, 
M = 9, K = 9 and S = 100. Red circles, green triangles and 
violet rectangles correspond to the cases of r = 2.0, r — 1.5, 
and r — 1.0, respectively. The blue lines denote the fitting 
functions, f(L) = 0.32(L + 0.52)" 151 e" i/6 ° for r = 2.0, and 
f(L) = 0.53(L+0.50)- 2 ' 70 e~ L/55 for r = 1.0. The inset shows 
the relation between the power-law exponent f3 and the pa- 
rameter r. 



city has the same attraction for the walker, however, in 
the real world, a central city is generally much more at- 
tractive than a small town. We next propose a modified 
model taking into consideration the inhomogeneous lo- 
cations and attractions of cities. In this inhomogeneous 
model, all cities are randomly distributed in an S x S 
square (we keep the number of cities in each layer the 
same as the original model), each of the nth-layer cities 
(2 < n < N) is connected to the nearest higher-layer 
city, and two nth-layer cities are connected if they are 
connected to the same higher-layer city. All the K lst- 
layer cities are fully connected to each other. 

As mentioned above, the center city should have 
greater attraction, which is represented by a layer- 
dependent weight, w n — r N ~ n , where n denotes the 
layer and r > 1 is a free parameter. The probability 
that the walker will move along an edge is proportional 
to its weight (A similar weighted random walk model 
has previously been proposed to explain the nonlinear 
dependence of the airport throughput on the connectiv- 
ity [Hj]). Clearly, the larger r indicates higher hetero- 
geneity. In Fig. 3, we show an illustration of a typi- 
cal trajectory in a 3-layer inhomogeneous model, and in 
Fig. 4 we report the trajectories for r = 1 and r = 2 
in a 5-layer inhomogeneous model. As shown in Fig. 5, 
for the inhomogeneous model, the displacement distri- 
bution, P(L), is still heavy-tailed and can be well fitted 
by a power-law function with an exponential cutoff, as 
P(L) = c(a + L)^ 13 e~ x / Xc . In addition, as shown in 
the inset of Fig. 5, when r increases from 1.0 to 2.0, 
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FIG. 6: (Color online) The probability W(d, t) of having trav- 
eled a distance d at time t. The parameters are N = 5, M = 9, 
K = 9, S = 100 and r = 2.0. This plot is obtained by av- 
eraging 1000 independent runs, each of which lasts 10 4 time 
steps. The black dash line, as a guide of eyes, is of slope 1. 



the power-law exponent, j3, monotonously decreases from 
2.70 to 1.51, covering the range of empirical observations 
[3, Q. This result suggests that the inhomogeneity in 
cities' attractions may be the reason why the absolute 
value of power-law exponent in the real human displace- 
ment distribution is lower than that predicted by the 
homogeneous model (i.e., 2.0), while the inhomogeneity 
of cities' locations enlarges the absolute value of such ex- 
ponent. 

Finally, we check whether our model can reproduce the 
spatiotemporal statistics of real human mobility. Provid- 
ing the trajectory of a random walker, one can obtain 
the probability W(d, t) of having traveled a distance d at 
time t (the same technique has been adopted in preparing 



Fig. 2a in Ref. please see details there). As shown 
in Fig. 6, a scaling behavior r(t) ~ t a with a « 1.0 
is clearly observed, which agrees well with the empirical 
result, a » 0.95, reported in Ref. Similar scaling 

behavior can also be observed for r = 1.0, however, the 
exponent, a rs 0.5, is far less than the empirical value. 
In addition, providing the travel displacement distribu- 
tion, this scaling behavior with a around 1.0 can not be 
reproduced by a Levy flight [1| . 

Uncovering the human traveling pattern is of funda- 
mental importance in the understanding of various spa- 
tiotemporal phenomena 0, [f| , and may finds applications 
in the design of traffic systems [l9| , the control of human 
infectious disease [2(j, the military service planning [2lj . 
and so on. Although empirical results about the scal- 
ing law of long-range human travels have been reported 
for years, it lacks the understanding of the underlying 
mechanism. This work gives raise to a very possible rea- 
son causing the heavy-tailed displacement distribution, 
P(L), that is, the hierarchical organization of traffic sys- 
tems. The secondary ingredient, also playing apprecia- 
ble role in determining the traveling pattern, is the in- 
homogeneities of the locations and attractions of cities: 
The former enlarges the exponent (3, while the latter de- 
presses it (essentially, the inhomogeneity of attractions 
results from the inhomogeneous population density and 
economic development). Actually, as shown in Fig. 5, 
with tunable strength of inhomogeneity, the exponent (3 
is also tunable. When r = 2, meaning the topper-layer 
cities having greater attractions, the statistical features 
produced by our model are very close to the empirical 
ones reported in Ref. Q, not only the displacement dis- 
tribution, but also the spatiotemporal statistics of mo- 
bility. 
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